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ABSTRACT 

We present a study on the clustering of a stellar mass selected sample of 18,482 galaxies with 
stellar masses Af* > at redshifts 0.4 < z < 2.0, taken from the Palomar Observa- 

tory Wide-field Infrared Survey. We examine the clustering properties of these stellar mass 
selected samples as a function of redshift and stellar mass, and discuss the implications of 
measured clustering strengths in terms of their likely halo masses. We find that galaxies with 
high stellar masses have a progressively higher clustering strength, and amplitude, than galax- 
ies with lower stellar masses. We also find that galaxies within a fixed stellar mass range have 
a higher clustering strength at higher redshifts. We furthermore use our measured clustering 
strengths, combined with models from Mo & White (2002), to determine the average total 
masses of the dark matter haloes hosting these galaxies. We conclude that for all galaxies in 
our sample the stellar-mass-to-total-mass ratio is always lower than the universal baryonic 
mass fraction. Using our results, and a compilation from the literature, we furthermore show 
that there is a strong correlation between stellar-mass-to-total-mass ratio and derived halo 
masses for central galaxies, such that more massive haloes contain a lower fraction of their 
mass in the form of stars over our entire redshift range. For central galaxies in haloes with 
masses A/haio > Mq we find that this ratio is < 0.02, much lower than the universal 

baryonic mass fraction. We show that the remaining baryonic mass is included partially in 
stars within satellite galaxies in these haloes, and as diffuse hot and warm gas. We also find 
that, at a fixed stellar mass, the stellar-to-total-mass ratio increases at lower redshifts. This 
suggests that galaxies at a fixed stellar mass form later in lower mass dark matter haloes, and 
earlier in massive haloes. We interpret this as a "halo downsizing" effect, however some of 
this evolution could be attributed to halo assembly bias. 

Key words: galaxies: evolution - galaxies: high-redshift - cosmology: observations - large- 
scale structure of Universe. 



1 INTRODUCTION 

Astronomers in the last decade have made major progress in un- 
derstanding the properties and evolution of galaxies in the distant 
Universe. Galaxies up to redshifts 2 ~ 6, and perhaps even higher, 
have been discovered in large numbers allowing statistically signif- 
icant population characteristics to be derived (e.g. Bouwens et al. 
2007; McLure et al. 2009). We in fact now have a good under- 
standing of basic galaxy quantities, such as the luminosity function 
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(e.g. Drory et al. 2003; Ilbert et al. 2005; Cirasuolo et al. 2007, 
2010), as well as how scaling relations, such as the TuUy-Fisher 
relation evolve up to, at least, z ~ 1.2 (e.g. Bohm et al. 2004; Con- 
selice et al. 2005b; Bamford et al. 2006; Weiner et al. 2006; Kassin 
et al. 2007; Bohm & Ziegler 2007; Chiu et al. 2007; Puech et al. 
2008; Fernandez Lorenzo et al. 2009). We have also begun to trace 
the stellar mass evolution of galaxies, as well as the star formation 
rate, determining when stars and stellar mass was put into place in 
the modem galaxy population (e.g. Borch et al. 2006; Bundy et al. 
2006; Fontana et al. 2006; Marchesini et al. 2009). 

Stellar masses are now becoming a standard measure for 
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galaxies, and are being used to trace the evolution of the galaxy 
population in terms of star formation rates and morphologies (e.g. 
Bundy et al. 2005; Conselice et al. 2005a, 2008; Noeske et al. 2007; 
Bell et al. 2007; Cowie & Barger 2008). However, stellar masses 
only trace one aspect of the masses of galaxies, and ideally and ul- 
timately, we want to be able to measure galaxy total masses, that 
include contributions from stars, gas, and dark matter. Galaxies are 
believed to be hosted by massive dark matter haloes that make up 
more than 85% of their total masses, and thus clearly tracing the 
co-evolution of galaxies and their haloes is a major and important 
goal. 

Measuring dynamical or total masses for galaxies is, however, 
very difficult, as it requires observations that are very challenging 
to obtain, or requires unusual and rare circumstances such as grav- 
itational lensing. For example, dynamical masses can be measured 
through rotation curves, or internal velocity dispersions, but this 
becomes difficult at higher redshifts, with reliable internal veloci- 
ties existing for only a handful of galaxies at z ~ 1.0 and above 
(e.g. Conselice et al. 2005b; Forster Schreiber et al. 2006; Swin- 
bank et al. 2006; Bohm & Ziegler 2007; van Starkenburg et al. 
2008). Furthermore, it is difficult to know whether these kinematic 
measures are tracing the total potential, or just the inner parts of 
galaxies. Likewise, using gravitational lensing to measure galaxy 
total masses is difficult, as it requires lensed background galaxies, 
and such instances are very rare. It is also not yet clear if these 
galaxies are representative, as they may have special mass profiles 
that are conducive to lensing. The weak galaxy-galaxy lensing tech- 
nique provides another way to estimate the total masses of galax- 
ies. No unusual circumstances are required, but individual measure- 
ments are extremely difficult to achieve. Stacking techniques give 
reliable results, but involve combining galaxies in haloes of differ- 
ent masses, which complicates the interpretation (e.g. Mandelbaum 
et al. 2006). Stacked satellite kinematics can also be used to probe 
the total masses of galaxies, but face the same problems as the weak 
galaxy-galaxy lensing method (More et al. 2009). Therefore, ob- 
taining total mass estimates for galaxies is difficult, and very few 
measures, or even estimates, have been produced for galaxies out- 
side the local Universe. 

One very powerful method for measuring the total masses of 
galaxies is to measure their clustering. Clustering measurements 
are independent of photometric properties, and as such they can 
be used to highlight fundamental properties of galaxy populations 
without assumptions concerning stellar populations or mass pro- 
files. Previously, it has been shown that galaxies with higher stellar 
masses cluster more than systems with lower stellar mass, with a 
very strong clustering above the characteristic stellar mass M* of 
the Schechter mass function (Li et al. 2006). In the halo model of 
galaxy formation, the large-scale distribution of galaxies is deter- 
mined by the distribution of dark matter haloes (Mo & White 2002). 
Therefore, halo clustering is a strong function of halo mass, with 
more massive haloes more strongly clustered, providing a means to 
study the relationship between galaxy properties, and dark matter 
halo masses. In fact, basic calculations allow one to convert a clus- 
tering strength for a sample of galaxies into a corresponding halo 
mass in which galaxies reside (e.g. Ouchi et al. 2004; Magliocchetti 
et al. 2008; Yoshida et al. 2008). 

Clustering measurements have been performed on galaxy 
samples in both the nearby and distant Universe. In the local Urri- 
verse, studies have established that clustering depends on the type 
of galaxy under consideration; for example, early-type, red, galax- 
ies are more clustered than late-type, blue galaxies (e.g. Guzzo 
et al. 1997; Norberg et al. 2002; Croton et al. 2007b; Cresswell 



& Percival 2009), and Ivuninous galaxies are more clustered than 
faint galaxies (e.g. Norberg et al. 2001; Zehavi et al. 2005; Skibba 
et al. 2006). Moreover, distant massive galaxies selected by their 
extremely red colours have been shown to strongly cluster, and 
therefore are thought to inhabit massive dark matter haloes (e.g. 
Foucaud et al. 2007; Quadri et al. 2007; Magliocchetti et al. 2008; 
Hartley et al. 2008). 

In this paper, we present the first general study of the clus- 
tering properties for a stellar mass selected sample of galaxies up 
to a ~ 2. We carry this out by measuring the correlation length 
and amplitude for galaxies selected with stellar masses M, > 
lO^"Af0 within the Palomar Observatory Wide-field Infrared Sur- 
vey (POWIR) (Bundy et al. 2006; ConseUce et al. 2008), which 
covers the fields where spectroscopy and other multi-wavelength 
data are available through the Deep Extragalactic Evolutionary 
Probe 2 (DEEP2) survey and All-wavelength Extended Groth strip 
International Survey (AEGIS) (Davis et al. 2003, 2007). In total 
we examine the clustering strength for 18,482 galaxies selected by 
stellar mass within 0.7 deg^. We derive an estimate of the total 
masses for these galaxies, and study in detail their stellar-mass-to- 
total-mass ratio. 

This paper is presented as follows: our data-sets, catalogues 
and our photometric redshifts and stellar mass estimations are de- 
scribed in Section 2; in Section 3 we describe the methods used 
in measuring the clustering properties of our samples; Section 4 is 
dedicated to the methods we use to derive the masses of dark matter 
haloes for our samples; in Section 5 we compare our results with 
the literature and models; and finally we summerise our conclu- 
sions in Section 6. Throughout the paper, we assume a ACDM cos- 
mology with = 0.3, r^A = 0.7, h = Ho/70 km s"^ Mpc~^ 
To ease comparisons with previous work, we use a concordance 
model with fiducial values of ris = 1.0 and as = 0.9. To deter- 
mine stellar masses throughout this paper, we use the Initial Mass 
Function (IMF) from Chabrier (2003) and assume a Hubble con- 
stant of i/o = 70 km s"^ Mpc"\ 



2 THE PALOMAR/DEEP2 SURVEY 
2.1 Data sets 

AU of the galaxies in this paper are found within three of the four 
fields covered by the Palomar Observatory Wide-Field Infrared 
Survey (POWIR, Table 1; Conselice et al. 2007). The POWIR sur- 
vey was designed to obtain deep i^-band and J-band data over 
a significant area (~1.5 deg^). Observations were carried out be- 
tween September 2002 and October 2005 over a total of ~ 70 
nights. This survey covers the Extended Groth Strip (Davis et al. 
2007), and three other fields that the DEEP2 team has observed 
with the DEIMOS spectrograph (Davis et al. 2003). The total area 
imaged in the i^T-band is 4920 arcmin^=1.37 deg^, with half of this 
area imaged in the J-band. The goal depth was K^.vcga = 21, al- 
though not all fields are covered to this depth, therefore we select 
the fields which have 5a depths between Ks.vcga = 20.2 - 21.5 
for point sources, measured in a 2" diameter aperture. For our pur- 
poses we abbreviate the fields covered as: EOS (Extended Groth 
Strip), Field 2, Field 3, and Field 4 (Table 1). In the following study 
we use data in the EGS field only to train our photometric redshifts 
(see section 2.2.1), while only the other three fields are used to 
perform our clustering analysis (see section 3.1). For extensive in- 
formation on this survey, and the data products we use from it, see 
Bundy et al. (2006), Conselice et al. (2007) and Conselice et al. 
(2008). 
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Field 


RA 


Dec. 


K-band area 


J-band area 








(arcmin^) 


(arcmin^) 


EGS 


14 17 00 


+52 30 00 


2165 


656 


Field 2 


16 52 00 


+34 55 00 


787 





Field 3 


23 30 00 


+00 00 00 


984 


984 


Field 4 


02 30 00 


+00 00 00 


984 


787 



Table 1. The Palomar Fields and WIRC pointings areas. The EGS field is 
used in this study for training the photometric redshifts, and the other three 
for measuring clustering properties. 

All of the ifs-band data were acquired utilising the Wide 
Field Infrared Camera (WIRC) on the Palomar 5 meter telescope. 
WIRC has an effective field of view of 8.1' x 8.1', with a pixel 
scale of 0.25"pixen^. The total survey contains 75 WIRC point- 
ings. During the ii's-band observations we used 30 second integra- 
tions, with four exposures per pointing. The J-band observations 
were taken with 120 second exposures per pointing. Typical total 
exposure times were between one and two hours for both bands. 
The reduction procedure follows a standard method for combin- 
ing near-infrared (NIR) ground-based imaging, and is described in 
more detail in Bundy et al. (2006). The resulting seeing FWHM 
in the ATs-band imaging ranges from 0.8" to 1.2", and is typically 
1.0". Photometric calibration was carried out by referencing Pers- 
son standard stars during photometric conditions, which were later 
cross calibrated with 2MASS stars (Skrutskie et al. 2006). 

The final NIR images were made by combining individual mo- 
saics obtained over different nights. The ATs-band mosaics are com- 
prised of coadditions of 4 x 30 seconds exposures dithered over 
a non-repeating 7.0" pattern. The images were processed using a 
double-pass reduction pipeline developed specifically for WIRC. 
The detection and photometry of our galaxies was performed using 
the SExtractor package (Bertin & Arnouts 1996). False artifacts 
are removed through SExtractor flags which identify sources that 
do not have normal galaxy or stellar profiles. From this we built 
a if-selected sample and then cross-referenced these sources with 
the DEEP2 redshift catalogue. 

Optical imaging from the Canada-France-Hawaii Telescope 
(CFHT), over all fields, is used to estimate photometric redshifts 
and stellar masses, with the help of spectroscopy from the DEIMOS 
spectrograph on the Keck n telescope (Faber et al. 2003). This op- 
tical imaging comes from the CFHT 3.6-m, and consists of data in 
the B-, R- and /-bands taken with the CFH12K camera - a 12,288 
X 8,192 pixels CCD mosaic with a pixel scale of 0.21". The inte- 
gration times for these observations are 1 hour in B and R, and 2 
hours in /, per pointing, with a 7i-band 5a depth of Rab = 25.1, 
and similar depths in B and I (Coil et al. 2004; Conselice et al. 
2007, 2008). From this imaging data a Rab = 24.1 magnitude 
limit was used for determining targets for the DEEP2 spectroscopy. 
The seeing for the optical imaging is roughly the same as that for 
the NIR imaging, and we measure photometry consistently, using a 
2" diameter aperture. 

The Keck spectra were acquired with the DEIMOS spectro- 
graph as part of the DEEP2 redshift survey (Davis et al. 2003). The 
selection of targets for the DEEP2 spectroscopy was based on the 
optical properties of the galaxies detected in the CFHT photome- 
try, with the basic selection criteria Rab < 24.1. Objects in Fields 
2, 3 and 4 were selected for spectroscopy based on their position 
in {B — R) vs. {R — I) colour space to focus on galaxies at red- 
shifts z > 0.7. Spectroscopy in the EGS was acquired using the 
magnitude limit, but to avoid the survey being completely domi- 



nated by lower redshift galaxies, sources with colours suggesting 
they are at z < 0.7 were down- weighted in favour of sources with 
z > 0.7. The total survey targeted over 30,000 galaxies, with about 
a third of these in the EGS field. In all fields the sampling rate 
for galaxies that meet the selection criteria is 60%. The DEIMOS 
spectroscopy was obtained using the 1200 line/mm grating, with a 
resolution R ~ 5000 covering the wavelength range 6500 - 9100 
A. Redshifts were measured by comparing templates to data, and 
we only utilise those redshifts determined by the identification of 
two or more lines, providing very secure measurements. 

Since accurate photometry, and photometric errors are impor- 
tant for the purposes of this paper we therefore give some details 
about our methods of measuring magnitudes. The photometric er- 
rors, and the detection limit of each i^T-band image, are estimated 
by randomly inserting simulated galaxies of known magnitude, sur- 
face brightness profile, and size into each image, and then recov- 
ering these simulated objects with the same detection parameters 
used for real objects (see Bundy et al. 2006; Conselice et al. 2008). 
To determine the detection and photometric fidelity of the images 
in more detail, two sets of 10,000 mock galaxies were created, each 
with an intrinsic exponential and de Vaucouleurs profile (see Con- 
selice et al. 2007). 

Systematic errors in the measurement of magnitudes, due to 
the detection method, were also estimated using the same simula- 
tions. The recovery fraction is found to be essentially 100% at the 
magnitudes of our sample galaxies. This was determined through 
simulating galaxies down to K = 21, as detailed in Conselice et al. 
(2007). However, all of the galaxies in our sample are at least a 
magnitude brighter than this, with all galaxies within our stellar 
mass limits having magnitudes if < 20 as shown in Conselice 
et al. (2007). Trujillo et al. (2007) and Conselice et al. (2007) fur- 
thermore carried out extensive simulations to show that at this Umit 
we are nearly 100% complete and are retrieving nearly all of the 
light from these galaxies. However, depending on the sizes of the 
galaxies, as much as 0.2 mag in the recovered Jf-band light could 
be missed, although for any single galaxy it is unlikely that this 
amount of light is missed due to the surface brightness profiles of 
our objects (Trujillo et al. 2007). This issue is addressed in great 
detail in Conselice et al. (2007), but in summary we account for 
this uncertainty in the error budget. 



2.2 Redshifts and Stellar Masses 

2.2.1 Photometric Redshifts 

We calculate photometric redshifts for our if-selected galaxies 
which do not have DEEP2 spectroscopy. These photometric red- 
shifts are based on the optical+near infrared imaging, in the 
BRIJK (or BRIK for half the sample) bands, and are fit in 
two ways, depending on the brightness of a galaxy in the optical. 
For galaxies that meet the spectroscopic criteria, -Rab < 24.1, we 
utilise a neural network photometric redshift technique to take ad- 
vantage of the large number of secure redshifts with similar pho- 
tometric data. Most of the Rab < 24.1 sources not targeted for 
spectroscopy should be within z < 1.4, as very few objects this 
bright are found at higher redshifts (e.g. Steidel et al. 2004). The 
neural network fitting is done through the use of the ANNz (Col- 
lister & Lahav 2004) method and code. To train the code, we use 
the ~ 5000 redshifts in the EGS field, whose galaxies are selected 
with a magnitude limit of i?AB < 24.1 and span the redshift range 
0.4 < z < 1.4 (see section 2.1). We then use this training to cal- 
culate the photometric redshifts for galaxies with Rab < 24.1 
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in all fields. The overall agreement between our photometric red- 
shifts and our ANNz spectroscopic redshifts is very good using this 
technique, with 5z/{l + z) = 0.07 out to 2; ~ 1.4. The agree- 
ment is even better for the M« >10^^Mo galaxies where we find 
6z/{l + z) = 0.025 across all of our four fields. The photometry 
we use for our photometric redshift estimates are measured within 
a 2" diameter aperture. 

For galaxies fainter than Rab = 24.1 we estimate photo- 
metric redshifts using Bayesian techniques based on the software 
from Bemtez (2000). For an object to have a photometric red- 
shift we require that it be detected at the 5cr level in all optical 
and near-infrared bands (BRIJK), which in the _R-band reaches 
Rab = 25.1 (Coil et al. 2004; Consehce et al. 2007, 2008). We 
optimise our results, and correct for systematics, through a compar- 
ison with spectroscopic redshifts, resulting in a redshift accuracy of 
6z/{l + z) = 0.17 for _Rab > 24.1 systems based on comparisons 
to redshifts from Reddy et al. (2006). These Rab > 24.1 galaxies 
are however only a very small part of our sample. Up to 2; ~ 1.4 
only 6 (2.6%) of our M« ylO^'^'^MQ galaxies are in this regime, 
while 412 (9%) of our lO^Mg < Af, <10" '^Mq galaxies have 
an _R-band magnitude this faint, with a similar fraction for galaxies 
down to M» = 10^'' Mq . Furthermore, all of these systems are at 
z > l.Atz > 1.4 all of our sample galaxies are measured through 
the Bemtez (2000) method due to a lack of training redshifts. We 
also compared the results at low redshifts between the Collister & 
Lahav (2004) and Benftez (2000) methods and their agreement is 
very good, at the same level than the comparison with spectroscopic 
redshifts. 

A thorough explanation, including error budgets and uncer- 
tainties, for our redshifts are included in Conselice et al. (2007) 
and Conselice et al. (2008). 



2.2.2 Stellar Masses 

We match our K-bemd selected catalogues to the CFHT optical data 
to obtain spectral energy distributions (SEDs) for all of our sources, 
resulting in measured BRIJK magnitudes. From these we com- 
pute stellar masses based on the methods and results outlined in 
Bundy et al. (2005), Bundy et al. (2006) and Conselice et al. (2007). 

The basic mass fitting method consists of fitting a grid of 
model SEDs constructed from Bruzual & Chariot (2003) (BC03) 
stellar population synthesis models, with different star formation 
histories. We use exponentially declining models to characterise the 
star formation history, with various ages, metallicities and dust con- 
tents included. These models are parameterised by an age, and an 
e-folding time for parameterising the star formation history, where 
SFR ae-r , The values of t are randomly selected from a range be- 
tween 0.01 and 10 G5T, while the age of the onset of star formation 
ranges from to 10 Gyr. The metallicity ranges from 0.0001 to 0.05 
(BC03). and the dust content is parametrised by rv, the effective 
V-band optical depth for which we use values ry ~ 0.0, 0.5, 1, 2. 
Although we vary several parameters, the resulting stellar masses 
from our fits do not depend strongly on the various selection cri- 
teria used to characterise the age and the metallicity of the stellar 
population. 

We match magnitudes derived from these model star forma- 
tion histories to the actual data to obtain a measurement of stel- 
lar mass using a Bayesian approach. We calculate the likely stel- 
lar mass, age, and absolute magnitudes for each galaxy at all star 
formation histories, and determine stellar masses based on this dis- 
tribution. Distributions with larger ranges of stellar masses have 



larger resulting uncertainties. Typical errors for our stellar masses 
are 0.2 dex from the width of the probability distributions. There 
are also uncertainties from the choice of the Initial Mass Function 
(IMF). Our stellar masses utilise the Chabrier (2003) IMF, which 
can be converted to Salpeter IMF stellar masses by dividing by a 
factor ~ 1.5 (see Chabrier 2003). There are additional random un- 
certainties due to photometric errors. The resulting stellar masses 
thus have a total random error of 0.2-0.3 dex, roughly a factor of 
two. The details behind these mass measurements and their uncer- 
tainties, including the problem of thermal-pulsating AGB stars, is 
described in Bundy et al. (2006) and Conselice et al. (2007). We ex- 
amine the changes in our stellar masses by using Bruzual & Chariot 
models updated in 2007 to include TP-AGB stars, where we find 
at most a decrease of 0.07 dex, or stellar masses which are 20% 
less than what we calculate using models without TP-AGB stars. 
Furthermore, Conroy et al. (2009) have recently shown that stellar 
mass estimates, from stellar population synthesis, suffer systematic 
uncertainties mainly due to key phases of stellar evolution and the 
initial mass function. However we are consistent in our analysis as 
we are comparing results from different studies based on the same 
stellar population models to determine stellar masses. Throughout 
this work, stellar mass is given in unit of Mq , having been com- 
puted assuming a Hubble constant of Hq = 70 km s~^ Mpc~^. 

In this study we only examine massive galaxies with stel- 
lar masses M, > lO^"A/0 . In fact, we examine the cluster- 
ing properties of galaxies within the following four mass selected 
samples 10^° ° Mq < ^ lO^ '^M© , lO^'^ '^Mo < M^ < 
10" °Mo, 10" °Afo< M» ^ 10"-^M©and 10"-°Mo < 
M* ^ 1O^2 °M0 . We also examine these stellar mass cuts within 
four different redshift bins: 0.4 < z ^ 0.8, 0.8 < z ^ 1.2, 
1.2 < z ^ 1.6 and 1.6 <« < 2.0. As discussed in Conselice 
et al. (2007), our lower-mass samples are largely incomplete in the 
higher redshift bins, and the volume we are sampling at lower red- 
shift does not allow us to examine a large enough sample of massive 
galaxies. The abundances of our different samples are reported in 
Table 2. We also note that our final two stellar-mass bins are not 
independent, as the third one overlaps completely with the fourth 
one. We decided to keep the bins this way for statistical reasons, 
as otherwise it would have been impossible to conduct studies on 
an independent higher stellar mass bin within the highest redshift 
bins. As we will see later, the results for our two higher mass bins 
are very similar, and this division does not effect our analysis. 

2.2.3 Photometric Redshift Errors 

In order to better constrain the errors introduced in our clustering 
and abundance analysis due to possible inaccurate photometric red- 
shifts, we built a set of five supplementary simulated catalogues 
based on a series of Monte Carlo simulations. We perform this sim- 
ulation by altering the redshifts of each of the galaxies in our sam- 
ple, randomly within our well calibrated and known photometric 
redshift errors, as based on comparisons to spectroscopic redshifts. 
The new stellar masses are then estimated with these altered red- 
shifts for each object. 

After recalculating these masses from our simulated redshifts, 
we then rc-sclcct the galaxies within our different redshift and stel- 
lar masses bins as in our original analysis, but by using these new 
simulated galaxy catalogues. The abundances of our different sam- 
ples reported in Table 2, are an average over all the catalogues (the 
original one and the five altered ones), and an average over our 
three fields. The error budget listed in this table takes into account 
the errors introduced by the photometric redshift, and the resulting 
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stellar mass errors, as accounted for by the rms over the six cata- 
logues, plus the errors introduced by the cosmic variance as mea- 
sured from the field-to-field variance over our three fields. Within 
this paper we systematically estimate our errors by accounting for 
these two effects. This is likely an over-estimation of the total er- 
ror, as the error on the photometric redshifts are partially included 
in the field-to-field variance. 

We also use these catalogues to estimate the fraction of in- 
terlopers that are contaminating each of our redshift bins. As the 
determination of stellar mass is directly linked with the photomet- 
ric redshift determination, it is very difficult to conduct an inter- 
loper analysis for both selections at the same time. We therefore 
focus on the interlopers between the different redshift bins, which 
is likely to have the highest impact on our clustering analyses. Fur- 
thermore, our measurements are made on three independent fields, 
all of which give similar results. Conselice et al. (2007) also per- 
formed a Monte-Carlo simulation in order to quantify the effect of 
stellar mass errors on the density and mass function of our samples. 
They found that the systematic effect is minor compared to the ef- 
fect of the cosmic variance, which we also find, and for which we 
account for as well. 

The interloper fraction within the different redshift bins is de- 
termined by identifying objects that are within the same redshift bin 
in all our simulated catalogues. Then the number of potential inter- 
lopers is identified in each catalogue based on the difference be- 
tween this standard set and the additional number of galaxies seen 
in each. However, this method of accounting for the photometric 
redshift errors increases artificially the number of interlopers, from 
the most populated redshift bins to the least populated. By com- 
paring our five simulated catalogues with our original one, we find 
that this effects account for ~ 10% of the galaxies in the simu- 
lated catalogues. Taking into account this effect, we estimate that 
the maximum possible interloper fraction is: for 0.4 < z < 0.8, 
/ = 21.6 ± 1.8%, for 0.8 < 2 < 1.2, / = 19.8 ± 1.7%, for 
1.2 < 2 < 1.6, / = 21.2 ± 2.0%, and for 1.6 < 2 < 2.0, 
/ = 29.2 ± 5.2%. 

Our interloper fraction is therefore at most between 20% and 
30% depending on the redshift bin. Given the errors on our photo- 
metric redshift determination, as quoted in Section 2.2.1, this is of 
the order we expect. Furthermore, we note that this fraction of in- 
terlopers is largely due to galaxies near the boundaries of our strict 
stellar mass and redshift cuts entering other bins. These galaxies 
typically have just slightly different redshifts and stellar masses be- 
fore they were simulated, creating a slightly different population 
within each simulated redshift bin which is not significantly dif- 
ferent from the original one. Regardless, we fully account for this 
effect within our measured clustering strengths error budget. 



3 CLUSTERING PROPERTIES OF MASSIVE GALAXIES 
3.1 Angular clustering 

The primary goal of this work is to study the clustering properties 
of galaxies selected according to their stellar mass. A clustering 
analysis is a study of the distribution of galaxies at all scales. Given 
its peculiar geometry, the EGS field is not ideally suited for such a 
study. Indeed, as galaxies are distributed along a strip in the EGS, 
the clustering properties at the largest scales are difficult to estab- 
lish given the small numbers of objects we have in each stellar mass 
and redshift bin in this field. For the other three fields we mask out 
overlap regions, and other regions which do not have reliable pho- 
tometry. This masking of regions empty of galaxies is then taken 



into account in the following analysis. Overall we performed our 
analyses over a total area of 0.7 deg^. 

Within our clustering analysis we first measure the 2-point an- 
gular correlation function oj{0) for our sample using the Landy & 
Szalay (1993) estimator: 



DD - 2DR + RR 
RR 



(1) 



where the DD, DR and RR terms refer to the number of data-data, 
data-random and random-random galaxy pairs having angular sep- 
arations between and 9 + 50. Figure 1 shows the correlation func- 
tion derived from our mass-selected samples in different redshift 
bins for one of our fields. 

The best fit for the angular correlation is assumed to be a 
power-law of the form (Groth & Peebles 1977): 



(2) 



with Au the ampUtude at 1 degree, 5 the slope, and Cs the inte- 
gral constraint due to the limited area of the survey. For statistical 
reasons we fix the slope to the commonly used fiducial value of 
6 = 0.8 (Groth & Peebles 1977). We also estimate the integral 
constraint (Cg) as follows (Roche et al. 1993), 



Cs 



d£lid£l2 



(3) 



where Q, is the area subtended by the survey field. To determine Cs 
we mmierically integrate this expression over each field, excluding 
masked regions. Assuming a slope of 5 = 0.8, the unmasked ar- 
eas (Sf) covered by the fields, and the corresponding integral con- 
straint values are: Sf2 = 639.55 arcmin^ and Co. 8 f2 = 3.61, 
Sf3 = 777.30 arcmin^ and Co.sfs = 3.76, Sfa = 731.04 
arcmin^ and Co.s f4 = 3.69, for fields 2, 3 and 4 respectively. 

In order to fit the clustering reliably, we avoid the small-scale 
excesses due to possible multiple galaxy occupation of a single dark 
matter halo, as has been shown to exist in recent studies (e.g. Ouchi 
et al. 2005). The lower bound on which we fit our correlation was 
chosen to correspond to ~ l/i~^Mpc , the observed one-halo term 
limit established by Zehavi et al. (2004). These lower bounds are 
shown by the vertical dotted lines on Figure 1, and it is worth noting 
that no apparent excess is displayed at the smallest scales. 

The error-bars shown on Figure 1 are estimated from a jack- 
knife Monte-Carlo method. The error estimation for each scale is 
made following the bootstrap method. In each sub-sample, objects 
are randomly removed and duplicated from our position catalogue, 
and the two-point correlation function is measured again. The value 
of the correlation function at each scale is estimated from the mean 
of these bootstrap catalogues, and errors are derived from the vari- 
ance. We then derive the best fit amplitude for each of our fields 
at 1 degree using a Marquardt least-linear method, that take into 
account our bootstrap errors, and provide an estimate of the error 
on the fitting. We then derive a mean value from our six catalogues 
and over each of our three fields. Table 2 summarises the values 
measured for our samples. 

As explained in Section 2.2.3, the variance over the six cata- 
logues is used as an estimate for the systematic errors in the photo- 
metric redshift determination and the variance over the three field 
measurements is used as an estimate of the cosmic variance of our 
samples. Our total error budget is a quadratic sum of the fitting 
errors, photometric redshift errors, and cosmic variance errors. As 
discussed in Conselice et al. (2007), using models from Somerville 
et al. (2004), the cosmic variance on the number counts in the red- 
shift range we use (0.4 < z ^ 2.0) range from a„ ~ 0.1 — 0.2 
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Figure 1. The 2-point angular correlation function as a function of redshift for our mass-selected samples in our field 3 (see text). The open triangle, open 
square, open star and open circle symbols represent the measurements made for the mass selected samples at stellar masses IO^'^'^Mq < M* ^ IO^'^'^Mq , 
10^°-^ Mq< M, ^ 1O" °M0 , 1O" °M0 < M, ^ 10"-5Mq and 1O" °M0 < A/, sC IO^^^Mq , respectively. The best fitted lines for the 
correlation function are represented by the short-dashed, dashed-dotted long-dashed and continuous lines, respectively for the different mass samples as 
described above. The vertical dotted line corresponds to the lower limit at ~ Ih^^Mpc of the range over which our data are fitted, in order to avoid any excess 
of pairs due to multiple halo occupation. 
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for our less massive samples, up to (j„ ~ 0.3 — 0.5 for our most 
massive ones. This is comparable with the uncertainties we derived 
based on our field-to-field measurements on the clustering analysis. 

Despite the accuracy of our photometric redshifts, as de- 
scribed in Section 2.2.3, our samples, selected in different redshift 
bins, are possibly contaminated by galaxies at other redshifts. Any 
contamination will dilute the original clustering signal, and there- 
fore our measurements of the amplitude of the correlation func- 
tion could be underestimated. From our Monte-Carlo simulations, 
we estimate the maximum contamination at a level from 20% to 
30%, mainly due to galaxies from adjacent lower redshift bins (see 
Section 2.2.3 for a full description of this). This contamination is 
stronger at higher redshifts. However, this effect does not signifi- 
cantly alter our results. In the worst case, if the contaminating pop- 
ulation is uncorrelated, the value of A^i is diluted by a factor of 
1/(1 — /)^ (with / the contamination fraction), corresponding to an 
underestimation of our correlation amplitude by a factor between 
1.56 and 2.04 (for 20% and 30% contamination by other redshift 
galaxies). These factors are upper limits, and the true value of A^j 
is likely to remain within the error-bars quoted in Table 2. In any 



case, this contamination will only reduce the strength of the clus- 
tering, and again our measures are therefore at worst lower limits. 
This means that any errors in redshifts and/or stellar masses only 
dilute our signal, which would make the real clustering strengths 
even higher than the ones we observe. Furthermore such an effect 
is included in our error budget as we are taking the average of mea- 
surements over six catalogues with perturbed redshift distributions, 
as described in Section 2.2.3. This method should account for the 
errors due to interlopers. 

Moreover, our samples, divided into different stellar mass 
bins, are also subject to possible contamination from other stellar 
mass bins. Given the mass uncertainties of 0.2-0.3 dex (see Sec- 
tion 2.2.2), it is possible that galaxies more or less massive, and 
therefore more or less clustered, contaminate our stellar mass cuts. 
As mentioned in the Section 2.2.3, these effects are more difficult 
to constrain with our Monte-Carlo simulations. However within a 
given redshift bin, the measurements of the clustering for samples 
in adjacent stellar mass bins are very similar and overlap within 
their errorbars. The cross-contamination would therefore only have 
a minor impact, and in the sense of increasing the segregation. 



© 2010 RAS, MNRAS 000, 1-19 



Luminous and dark matter mass ratio for massive galaxies to z ^ 2 7 




Figure 2. (a) Correlation length (ro) as a function of redshift. The correlation lengths measured for our different samples are represented by the full circles 
with the size of the curie increasing with the stellar mass of the samples. We also show measurements performed by Meneux et al. (2008) for two mass 
ranges (Log [M» /(h^ggMQ)] = 10.22 and 10.70). The lines represent predictions from models by Mo & White (2002) for different minimum masses of 
dark matter haloes (plain lines for M^in = 10" °/i-1Mq , lO^^ O/j-i jvf^ and lQ^3.0f^-lj^^ _ dashed lines for M^in = 5 X W^^ °h-^MQ , 
5 X 10^^'^h~^ Mq and 5 X IO^^'^/i^^Mq ) while the dotted lines represent the predictions from models including halo distributions as described in Zheng 
et al. (2007). (b) Abundance (n) as a function of redshift. The abundances measured for our different samples are represented by the full circles with the size 
of the circle increasing with the mass of the sample. The lines represent predictions from models by Mo & White (2002) and Zheng et al. (2007) for different 
minimum masses of dark matter haloes (as described above). 



3.2 Spatial correlation lengths and bias 

In order to compare the clustering of galaxy populations at differ- 
ent redshifts we derive the spatial correlations for our galaxy sam- 
ples. We do this by using the spatial correlation function (^) (Groth 
& Peebles 1977) to derive ro, the co-moving galaxy correlation 
length, as defined by: 

^{r,z)^(^) ' (4) 

where 7 = 1 + 5. The redshift dependence is included in the co- 
moving correlation length ro{z). In general the larger the correla- 
tion length (ro), the more clustered the galaxies are. 

By measuring the redshift distribution for our samples we can 
also derive the correlation length ro from the amplitude of the angu- 
lar correlation A^i using the relativistic Limber equation (Maglioc- 
chetti & Maddox 1999). We estimate correlation lengths assuming 
that our galaxies are distributed as Top-Hat functions in each of 
our narrow redshift bins. Given the derived statistics of our sam- 
ples and the narrow redshift bins we are using here, and despite the 
accuracy of our photometric redshifts, changing the shape of the 
redshift distribution has a negligible impact on the results. 

The correlation length is derived from the amplitude of the an- 
gular correlation function in each of our redshift and stellar mass 
bins, using each of the six catalogues (the original and the simu- 
lated versions) for each of our three fields. The mean correlation 
length is then computed, as well as its associated error budget, 
as described in Section 3.1. In summary, we find that correlation 
lengths vary from 5/i^^Mpc to 15/i^^Mpc for galaxies selected 
by stellar masses with M, > 10^" . The highest correlation 
lengths are for the most massive galaxies at the highest redshifts 



2 = 1.6 — 2 (Figure 2(a)). In general, at lower stellar masses and 
redshifts, the correlation length becomes smaller. 

Table 2 summarises the values we calculate for the correlation 
length ro measured within our stellar mass selected samples. In 
Figure 2 we plot the correlation length, and the galaxy number den- 
sity, as a function of redshift for our stellar-mass-selected galaxy 
samples. We observe an apparent decrease in the correlation length 
with decreasing redshift, as is also observed in other previous stud- 
ies (e.g. Le Fevre et al. 2005). Here the main effect is the well- 
known luminosity-segregation, with brighter galaxies (less abun- 
dant) having a larger correlation length (Polio et al. 2006; Coil et al. 
2006). As a first rough approximation, this can be directly linked to 
a mass-segregation effect, with more massive galaxies more clus- 
tered. Previously, McCracken et al. (2008) showed that bright red 
galaxies, that are likely very massive, have clustering lengths that 
are almost invariant with redshift since 2 ~ 1, while less luminous 
(likely less massive) galaxies have correlation lengths that decrease 
with fainter magnitude at all redshifts. 

As shown in Figure 2(a), we compare our results with Meneux 
et al. (2008), who have derived correlation lengths from spatial cor- 
relation functions using the VIMOS-VLT Deep Survey (VVDS) 
data-sets. The mean redshift of their sample is 2 ~ 0.85, and for 
galaxies with masses Log [M./(/iJ"ooM0)]) = 10.22 they de- 
rived ro — 5.06 ± 0.49/i~^Mpc , and for galaxies with masses 
Log [M,/(fe-(foMQ)] = 10.70, ro = 6.21 ± 0.67/i"^Mpc . We 
note that these results are in good agreement within the error-bars 
of our measurements. 

As analysed in Section 3.1 a contamination of our different 
samples by galaxies in adjacent redshift and stellar mass bins will 
artificially decrease the measured value of their correlation length. 
This effect is more likely to occur within the higher redshift sam- 
ples, and thus the values summarised in Table 2 can be considered 
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Table 2. Summary of the measurements performed for each sample of galaxies selected in redshift and stellar mass. The mean redshift, the mean stellar mass, 
surface density, space abundance, amplitude of clustering at 1 degree with a slope of & = 0.8 (according to equation 2), the correlation length (according to 
equation 4), and the linear bias (according to equation 5) are compiled for each sample. 



as lower limits of the real intrinsic correlation length. Such an ef- 
fect has bin identified in previous analysis as well (e.g. Amouts 
et al. 1999; Quadri et al. 2008). The net effect of this is that our 
estimates of the total masses for these galaxies are also lower lim- 
its. However, even with a contamination fraction at the upper limit 
of 20% and 30%, as quoted in Section 3.1, the values of the cor- 
relation lengths we quote in Table 2 are only under-estimated by a 
factor ^ 1.28 and ~ 1.41 respectively. The relative uncertainties 
on our measured correlation length are of the same order as this cor- 
rection (Table 2). This is an upper limit of the correction that could 
be appUed, and furthermore our correlation length measurements 
and errors take into account these potential effects from interlopers 
as described in Sections 2.2.3 and 3.1. 

We also note that Hartley et al. (2008) have found that the 
errors on photometric redshifts can artificially broaden the redshift 
distribution which leads to an overestimate of the correlation length 
ro. However given the narrow bins we are using in the current anal- 
yses, our redshift distributions are more perturbed by objects scat- 
tered from one redshift bin to another, than from a broadening effect 
due to photometric redshift errors. 

In addition, we derive the linear bias of our sample, which re- 
lates the clustering of galaxies to that of the overall dark matter 
distribution (Magliocchetti et al. 2000). The rms density fluctua- 
tions of haloes (cr8,gai {z)) are linked to the rms density fluctuations 
of the underlying mass (cr8,m(«)) by the bias (6(2)) following: 



where o'8.m(,z) = <^»D{z) is the variance in 8/i~^Mpc spheres, as- 
suming that dark matter behaves as predicted in linear theory (lin- 
ear growth D[z) - Carroll et al. 1992), renormalised to the fiducial 
value Us (fixed to (78 = 0.9 in this study). If the galaxy correlation 
function is fit as a power law, then it can be integrated to give the 
relative variance in Sft^^Mpc spheres: txs.ga; (Peebles 1980). We 
find for our sample that the bias varies from 6 ~ 1.1 to & ~ 4.2 for 
galaxies selected by stellar masses M, > 10^° M0 .The most mas- 
sive galaxies at the highest redshifts 2: = 1.6 — 2 are more biased, 
in good agreement with biased galaxy formation models (Bardeen 
et al. 1986). Table 2 summarises the values of the bias b measured 
for our samples. 



4 TOTAL DARK MATTER MASS OF MASSIVE 
GALAXIES AT z < 2 

4.1 Modelling the dark matter halo correlation lengths and 
abundances 

The standard CDM model predicts that at any redshift more mas- 
sive dark matter haloes are on average more clustered than lower 
mass systems. To model this effect quantitatively, we use the pre- 
dicted effective bias, and the abundance evolution, as derived from 
the Mo & White (2002) formalism for different minimum dark 
matter halo (DMH) mass thresholds. This formaUsm is based on 
modelling the effective bias, which relates the mass fluctuations of 
haloes, to the mass fluctuations in spheres that contain an average 
mass M of underlying dark matter, according to equation 5. For 
this purpose, we use the halo abundance distribution as a function 
of mass and redshift determined by Sheth & Tormen (1999), de- 
rived from fits to large N-body simulations, and the linear halo bias 
for a given mass at a given redshift, calculated using the function of 
Sheth et al. (200 1 ) . We directly derive the effective bias by integrat- 
ing the ratio of these functions on masses above a given minimum 
mass Mmi 

A modelling of the correlation length evolution for DMHs of a 
given minimum mass Mmin at a given redshift z can be directly de- 
rived from the effective bias using the recipes described in Maglioc- 
chetti et al. (2000). We use also these recipes in the computation 
of the linear bias for our sample of galaxies, as described in Sec- 
tion 3.2. Details of this method can be found in the cited papers. 
The predicted correlation lengths and abundances evolutions as a 
function of redshift are shown in Figures 2(a) and (b), for different 
values of the minimum mass of DMHs, Mmin- 

We note that this model assumes that there is only one galaxy 
per DMH. Such an assumption can and often must be incorrect, 
especially in the case of the less massive galaxies in the low- 
est redshift bins. If satellite galaxies are taken into account, us- 
ing the same galaxy bias factor measured from the data, a dif- 
ferent Afmin would be inferred. In order to take into account the 
Halo Occupation Distribution (HOD) effect we follow the simple 
recipe provided in Zheng et al. (2007), by sunmung a step func- 
tion for central galaxies and a power law for satellite galaxies, 
{N{M)) = 1 + M/Mi for M > Mmin- As inferred in Zheng 
et al. (2007), and following galaxy formation model predictions 
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^"■^Redshift bin; Stellar mass bin in Log(M/MQ); ^'^^Mean redshift; ^'^^Mean stellar mass in Log{M/MQ); ^"^"i Minimum mass of the Dark Matter Halo 
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Table 3. Summary of the measurements performed for each sample of galaxies selected in redshift and stellar mass. The mean redshift, the mean steUar 
mass, the minimum masses of associated DMHs as inferred by the comparison between the DMH model and the correlation length,the DMH model and the 
abundances, and the correction factor to apply to take into account the halo distribution effect are compiled for each sample. In addition we add the value we 
use as the mass of the DMH and the stellar-mass-to-dark-matter-mass ratio, as inferred by the correlation measurements, and corrected for the halo occupation 
effect. 



(e.g. Zheng et al. 2005) and HOD modelling results (e.g. Zehavi 
et al. 2005), we use Mi = 20Minin- The modified correlation 
lengths and abundance evolutions are shown in Figures 2(a) and (b) 
with dotted Unes, for different values of the minimum mass Mmin. 



4.2 Dark matter masses of massive galaxies 

In Figures 2(a) and (b) we show the predicted relation, based on 
dark matter halo models and on Halo Occupation Distribution, be- 
tween the correlation lengths and the number densities of dark mat- 
ter haloes, and how this relation evolves with redshift. These pre- 
dictions are in good agreement with our observations. As expected, 
the mass segregation effect is present, with the correlation lengths 
derived for galaxies with higher stellar mass agreeing well with the 
predicted correlation lengths of haloes with the highest dark matter 
masses. 

We go a step further than this and associate the observed cor- 
relation lengths of our stellar mass selected samples with the pre- 
dicted value of their host halo masses, thereby providing an esti- 
mate of the typical halo mass for each stellar mass selected bin. 
To do this, we make the assumption that only one galaxy is hosted 
per halo. Later we apply a correction to take into account the halo 
occupation effect. Finally, we investigate the stellar masses of all 
galaxies within these haloes to obtain a measurement of the total 
stellar to halo mass within dark matter haloes. 

This total mass measurement from our clustering analysis 
gives us a measure of how massive the haloes that host massive 
galaxies are. To obtain these mass measures, we interpolate masses 
from a grid of predictions in correlation length/redshift space, from 
the models described in Section 4.1. Each observed correlation 
length at a given redshift is then assigned to a model with a given 
DMH minimum mass. This method has been used previously in 
papers such as Ouchi et al. (2004); Magliocchetti et al. (2008); 
Yoshida et al. (2008). The errors on the correlation lengths are prop- 
agated to obtain errors on the DMH minimum mass in the same 
way. A similar approach is used to estimate the mass of DMHs, by 
comparing model results with the abundances. Table 3 summarises 
the values derived for the DMH minimum mass A/min from their 
correlation length values and from their abundances. 



Our assumption of a single galaxy per halo can result in over- 
estimating the true value of the DMH minimum mass. Following 
Zheng et al. (2007), we estimate the effect of halo occupation using 
a simple prediction described in Section 4. 1. However as shown by 
Zheng et al. (2007), the effect of the "one galaxy per halo" assump- 
tion on the true measure of the DMH mass can be very important 
at the lower bias levels found in the lower redshift part of our cata- 
logue. The simple correction we apply is unlikely to be sufficient to 
correct for this effect, so we later discard these sub-samples from 
our following analysis, as shown in Table 3. 

More detailed models, such as HOD models (e.g. Wechsler 
et al. 2001; Zehavi et al. 2005), are likely more appropriate to de- 
termine the mass of DMHs associated with a sample of galaxies. 
For instance, in order to have a reliable estimate of the mass of 
the DMHs, such halo occupation models should fit directly from 
the angular correlation function determined for the samples in the 
lower redshift and stellar masses bins. However, for our higher red- 
shift samples with a strong bias, we have estimated our correlation 
amplitude on scales > l/i~^Mpc , where the effects of any ex- 
tra halo terms is avoided (see Section 3.1). Zehavi et al. (2004) 
have shown that one-halo term clustering is only observable on 
scales smaller than this limit. Therefore, our lower mass galaxies 
likely reside in the same haloes, but our measurements are made on 
scales where galaxies are in separate haloes. This should reduce the 
effect of multiple halo occupation on our estimation of the DMH 
mass even for the lower mass samples. One advantage in measur- 
ing clustering over large areas, such as in this study, is the ability 
to avoid the one-halo clustering regime, unlike in previous studies 
using smaller areas. 

As shown in Figures 2(a) and (b), the minimum mass of dark 
matter haloes, for a given sample of galaxies, can differ when es- 
timated from the correlation length measurements or through us- 
ing abundance measurements. Magliocchetti et al. (2008) raised a 
similar issue in their analyses, and showed that this discrepancy 
could also come from an erroneous a priori assumption of the one- 
parameter halo bias model for a one-to-one correspondence be- 
tween dark matter haloes and astrophysical sources. When we cor- 
rect for this effect the two estimations are broadly in agreement. 
The masses of DMHs derived from the comparisons between the 
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DMH model (without halo occupation effects corrected) and the 
correlation lengths and abundances are summarised in the Table 3. 

Figure 3 shows the mass of DMH inferred by the abundance 
measurements compared with the one inferred by the correlation 
length measurements, with both quantities corrected to account for 
halo distribution effects. The mild residual discrepancy between 
the masses estimated by comparing the models with the correla- 
tion length or the abundances, can be explained by several, likely 
combined, effects. 

First, and most likely, the contamination effects between the 
different bins of our sample could be at the origin of the dis- 
crepancy seen in Figure 3. As seen in Section 3.2, the correlation 
lengths we are using here are lower limits, due to possible con- 
tamination of the samples by interlopers. Therefore the masses of 
the DMHs we measure from the correlation lengths are also lower 
limits. However, this is what we want as the models provide the 
minimum mass of the DMHs, and using the lower-limits is consis- 
tent with this. Also, interlopers will affect the measurements of the 
abundances of our different samples. It is more likely that we are 
overestimating the number of objects in our less populated sam- 
ples, which are the more massive systems. In this case, the masses 
of DMHs derived from the abundances of our most massive stel- 
lar samples are likely underestimated. The combination of these 
two effects is likely the origin of the discrepancy between our two 
estimates, as shown in Figure 3. These contamination effects are in- 
cluded in our error budget as we discuss in Section 3.2, and, given 
the difficulty in obtaining a precise evaluation of the contamination, 
we do not correct our measurements directly for these effects. 

A second possible explanation, would be incorrect redshift 
distributions for our samples. Quadri et al. (2008) found a simi- 
lar discrepancy between the clustering and the abundance of their 
colour selected sample, and invoked an incorrect redshift distribu- 
tion as a likely origin of such discrepancy. In our case as we are 
using simple Top-Hat functions, given the narrow size of our red- 
shift bins, this effect relates directly to the contamination. 

Furthermore, Tinker et al. (2010) claim that such a discrep- 
ancy could originate from cosmic variance and incorrect models 
of the halo mass function and halo bias function. In our case, we 
take into account cosmic variance in our error budget, through the 
field-to-field variance, so we can partially reject this explanation. 
Finally, our models may be incorrect, despite being extensively 
tested. We recompute the model using a slightly modified halo bias 
function from the original function of Sheth et al. (2001), as pro- 
posed by Tinker et al. (2005). By changing the halo bias function 
we only modify the masses estimated from the correlation lengths. 
As shown in Figure 3, the masses inferred by this new model are not 
in better agreement, and the measurements remain almost within 
our original errorbars. We decided to keep the measurements made 
using the Sheth et al. (2001) function, as it is more commonly used. 
However other inputs in our models, such as the halo mass function, 
may also be at the origin of the discrepancy. 

Gao et al. (2005) have also shown that the clustering of DMHs 
depends not only on their halo mass but also on their assembly his- 
tory, and especially on their formation time. This "Halo Assembly 
bias" implies that, at a given mass, DMHs which assembled earlier 
are more clustered than DMHs that assembled later. This effect will 
increase the discrepancy between masses measured from the corre- 
lation length matching, and from the abundance matching. Croton 
et al. (2007a) have shown that the bias induced by the environ- 
mental dependence of halo formation history at a fixed halo mass 
is stronger for faint red central galaxies and weaker for bright red 
galaxies. The assembly bias enhances the two-point correlation by 
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Figure 3. Comparison between the masses of DMHs, as estimated using 
correlation lengths (ro) and abundances (n). The masses are corrected from 
the Halo Distribution effects. The plain symbol represent the masses esti- 
mated from the model using the halo bias function from Sheth et al. (2001), 
and the dotted symbols from the model using the halo bias function from 
Tinker et al. (2005). 



^ 10% for bright galaxies, and suppresses it by the same amount 
for fainter galaxies. However the relative strength in clustering be- 
tween our more massive and less massive samples is ~ 40%, much 
higher than the effect of the assembly bias itself. Furthermore, Gao 
et al. (2005) report that this effect is weakest for DMHs with masses 
Mdm > 10" /i-^M© which are within our range. 

The masses of the DMHs calculated by matching correlation 
lengths and abundances to the models both suffer from uncertain- 
ties. However as shown in Figure 3, the two estimates are broadly 
in agreement within our errorbars. We decided therefore to use 
the mass estimated from the correlation lengths corrected from the 
HOD effect, as we have a more complete error budget in this case. 



5 THE RATIO OF STELLAR AND DARK MATTER MASS 
FOR MASSIVE GALAXIES BETWEEN < z < 2 

5.1 History of mass assembly of galaxies in massive DMHs 

5.1.1 Stellar mass fraction in the central galaxy 

Since we know the median stellar mass within each of our sample 
cuts, and the clustering and hence total masses of these same galax- 
ies, we derive directly the stellar mass fraction as a function of total 
mass, and as a function of redshift for our sample. We report these 
values in Table 3, and display the results on Figures 4 and 5. 

In order to study the link between luminous and dark mat- 
ter we make several assumptions. First we assume that the mini- 
mum mass of the DMHs, determined as explained in Section 4.2, 
is an accurate approximation of the median mass of these DMHs 
(A/min — Mdm). As we are mainly studying DMHs with masses 
A/dm > IQi^/i^^Ma , and the mass function of DMHs drops 
rapidly above this limit, we conclude that this assumption is fair 
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Figure 4. Evolution of the stellar mass fraction for massive galaxies as a function of the dark matter halo mass (A/dm)- The measurements made for this study 
are shown as red open and filled circles, with bigger size symbols coiresponding to higher redshift samples. Different measurements taken from the literature 
are overplotted, with the size of the symbol bigger for samples with z > 0.7. The continuous line represents the baryonic fraction measured from WMAP5 
(Komatsu et al. 2009), and the dashed line represent the mean stellar fraction in the local Universe estimated by Cole et al. (2001). 



(Warren et al. 2006). Our second assumption, for comparison to 
other work, is that the virial mass of a system is equivalent to the 
total mass in the DMH, which has been confirmed by Busha et al. 
(2005) using a large N-body simulation. The third assumption we 
make, in order to compare our measurements with literature values, 
is that the mass in dark matter is equivalent to the total mass of the 
system. Indeed, the Mo & White (2002) models we use to estimate 
the mass of the DMHs are based on the behaviour of dark matter, 
and do not take into account presence of baryonic matter. Given 
that baryonic matter accounts for < 15% of the total density in the 
Universe (Komatsu et al. 2009), this last assumption is justified, as 
> 85% of the density of the system is made up of dark matter. 

As explained in Section 4.2, we corrected our halo masses for 
multi-occupation of the haloes. We also focus our analysis on the 
highly biased samples, those most likely to have a one-to-one cor- 
respondence between the galaxy and its DMH, sans any outer satel- 
lite galaxies, which is an issue we address later. We can therefore 
assume that we are looking at the stellar mass fraction of the central 
galaxies, and are mostly free of any effect due to satellite galaxies 
in the present analysis. 

To compare our results to previous work, we collected mea- 
surements of stellar and total masses from the literature based on 



various different methods to estimate the masses of various DMHs. 
These include total masses of disk-like galaxies based on rotation 
curves at 0.2 < z < 1.0, from Conselice et al. (2005b), as well as 
at 2 ~ 2.0, from van Starkenburg et al. (2008). The stellar masses 
used in these studies are estimated from SED fits based on spec- 
troscopically confirmed redshifts, and a wide range of photometry, 
similar to that described in Section 2.2.2, but by using a Salpeter 
IMF. We converted all stellar masses into a Chabrier IMF by divid- 
ing by a factor of ~ 1.5 (Chabrier 2003). 

We also added measurements of nearby early-type (ET) and 
late-type (LT) galaxies from galaxy-galaxy lensing studies in the 
Sloan Digital Sky Survey (SDSS) from Mandelbaum et al. (2006). 
Stellar masses for these SDSS galaxies are obtained from the 
method described in Kauffmann et al. (2003), based on the strength 
of the 4000A-break and the Balmer absorption index H(5,4, for 
a Kroupa (2001) IMF (which give estimates very close to the 
Chabrier (2003) IMF). Finally, we use the publicly available cat- 
alogue of groups in the SDSS from Weiiraiann et al. (2006) 
to estimate the mass ratio for galaxy groups in the local Uni- 
verse. Total masses of these groups are determined by ranking the 
completeness-corrected group luminosity, and by assigning to them 
the mass of the DMHs, ranked according to their abundances pre- 
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dieted by models from Mo & White (2002). The central galaxies 
of each group are assumed to be the most luminous galaxy in the 
group. The stellar masses for this sample are determined using the 
method described in Kauffmann et al. (2003). 

We add as a continuous line on Figures 4 and 5 the value of the 
mean baryonic fraction in the Universe, estimated from WMAP5 
(Komatsu et al. 2009), which is fb = Vtb/Q.ra = 0.165 ± 0.004. 
From a study of galaxies in the local Universe, Cole et al. (2001) 
measure the stellar mass density to be fi*/iioo = (2.9 ± 0.43) x 
10"^ for a Salpeter IMF If we scale this value to a Chabrier 
(2003) IMF, and combine it with the cosmological parameters from 
WMAP5 (Komatsu et al. 2009), we derive a mean stellar mass frac- 
tion in the Universe of /, = (2.03 ± 0.04) x 10~^ , which is shown 
in Figures 4 and 5 as a dashed line. The remaining part of the bary- 
onic fraction that is not included in stars is possibly present in the 
form of hot and warm gas, as seen in clusters (e.g. Arnaud & Evrard 
1999). 

Figure 4 shows that there is a good agreement between these 
very different mass measurements, especially at the massive end. 
The Mt/MuM ratio, even when estimated from completely dif- 
ferent methods, shows a clear decrease at higher halo masses in 
all studies. However, estimating accurate DMH masses is difficult, 
especially in the less massive cases. Dynamical estimates (rota- 
tion curves, e.g. Sofue & Rubin (2001); velocity dispersions, e.g. 
Balogh et al. (2007)) trace the innermost region of the galaxy, 
requiring extrapolation to estimate the total mass of the system. 
Weak-lensing requires the use of stacking methods to provide an 
accurate estimate of the total mass, and tend to be difficult to 
achieve in the case of less massive DMHs (e.g. Hoekstra et al. 2001 ; 
Mandelbaum et al. 2006). In our case, galaxies in less massive dark 
matter haloes are not strongly biased, making the assumption of a 
one-to-one correspondence between the galaxy and the DMH less 
appropriate, as mentioned earlier. However our method, based on 
clustering properties at large scales, is efficient enough to deter- 
mine the luminous-to-dark matter mass ratio for massive galaxies, 
and has the advantage of allowing us to estimate masses at higher 
redshifts, where for instance lensing cannot be used. 

Overall, we find that more massive systems have a lower ra- 
tio of stellar-to-halo-mass, as shown in Figure 4. This behaviour is 
seen in all of our measurements at l.Q < z < 2.0, as well as in the 
local Universe with the SDSS, and thus appears to be independent 
of redshift, and is perhaps universal. This relation implies that, at 
fixed total mass, the relationship between the mass of the central 
galaxy and its DMH does not evolve strongly with cosmic time. 
Physically this may imply that very massive galaxies struggle to 
increase their stellar mass in very clustered environments, resulting 
in a departure in the luminous-to-dark-matter mass ratio in massive 
DMHs compared to lower mass DMHs, which is below the aver- 
age value in the local Universe. These observations also imply that 
there is a limit to how much stellar mass a galaxy can have, with a 
cut off at a few times lO^^ilff, . This is consistent with the observed 
cut-off at high mass of the stellar mass function at low redshift (e.g. 
Baldry et al. 2008). 

Furthermore, a decreasing stellar mass fraction with increas- 
ing halo mass has been observed in groups and clusters (e.g. Eke 
et al. 2004; Lin & Mohr 2004; Gonzalez et al. 2007; Balogh et al. 
2007). Lin & Mohr (2004) show that Brightest Cluster Galaxies 
(BCGs) contribute a large fraction to the total light of their host 
cluster, but become progressively less important in the overall lumi- 
nosity budget in the highest mass clusters. This effect is explained 
by differential growth between the BCGs and their host clusters, 
due to clusters accreting nearby galaxies in lower mass groups. 



while the BCGs grow modestly by merging or carmibalism (Whiley 
et al. 2008). Similarly we can imagine that the clusters will ac- 
crete more dark matter in a similar process while the central galaxy 
grows more slowly. 

5.1.2 The stellar mass fraction in satellite galaxies 

In Section 5.1.1 we focussed our analysis on the central galaxy of 
the DMH. Indeed, as explained in Section 4.2, our clustering mea- 
surements are made in a way where we can assume there is only one 
massive galaxy per DMH, from which we measure the mass of the 
host halo. However, an overabundance of satellite galaxies has been 
observed in massive DMHs (e.g. van den Bosch et al. 2005; Yang 
et al. 2005). It is therefore possible that a non-negligible fraction 
of the stellar mass in the most massive DMHs resides in satellite 
galaxies (e.g. Lin & Mohr 2004; Balogh et al. 2008). 

To study this possibility in more detail we use the SDSS group 
catalogue (Weinmann et al. 2006), as it provides stellar masses for 
each member of each group. We limit our analysis of these groups 
to those at 2 < 0.06 in order to be complete at stellar masses 
M, > lO^^^Afr; , and at 2 < 0.045 for M, > IO^'^Mq . We plot 
the ratio of the total stellar mass in the SDSS group DMHs to the to- 
tal mass of the DMHs in Figure 6. As shown in this figure, the lim- 
ited completeness of the sample does not allow us to conclude if the 
mass in satellite galaxies can account for the totality of the "miss- 
ing" stellar mass in the DMH. However, the stellar mass distribu- 
tion of satellite galaxies in the SDSS, hosted by groups with DMHs 
masses of lO^^ "/i"^M0 < Mom < lO^^'^hr^MQ , is shown in 
Figure 7. This study is limited to z < 0.06, and shows a plateau 
before reaching the completeness limit of M* > 10^" " Af(5 . This 
flattening at the faint end is similar to the observations of the lumi- 
nosity function of the Local Group (e.g. Pritchet & van den Bergh 
1999). 

Furthermore, a decrease in the stellar-mass fraction is also ob- 
served in local clusters. Vikhlinin et al. (2006) observed a defi- 
ciency of X-ray gas in galaxy groups compared to clusters. Gon- 
zalez et al. (2007) explain these observations by the fact that the 
missing X-ray gas has cooled in less massive systems to form stars. 
This implies that a larger fraction of the baryonic matter in clusters 
is in the form of hot gas in more massive systems (e.g. Arnaud & 
Evrard 1999). However, while the total mass can be derived from 
the virial mass for these massive clusters, the total stellar mass is 
harder to estimate. A non-negligible amount of the light emitted by 
a cluster resides in its Intracluster Light (ICL), as dynamical pro- 
cesses can strip stars from galaxies which orbit in the intracluster 
space (e.g. Lin & Mohr 2004; Gonzalez et al. 2007, and references 
therein), making the estimate of the total stellar-mass in a cluster 
difficult to measure accurately. 

In our study of the stellar-mass fraction from the SDSS group 
catalogue, given the likely incompleteness in establishing group 
membership, we can only conclude that there is a weak trend in 
which the stellar-mass fraction in more massive systems is lower 
than in less massive ones, as shown in Figure 6. Further study of 
the role of satellite galaxies in the total amount of stellar-mass in a 
DMH is required to fully understand this result. 



5.1.3 "Halo downsizing ": a possible scenario for the formation 
of massive DMHs 

Figure 5 shows that, at fixed stellar mass, the stellar mass fraction 
increases at lower redshift. In other words, the stellar component 
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Figure 5. Evolution of the stellar mass fraction of massive galaxies as a function of redshift. The measurements made in this study are shown as red open and 
filled circles, with bigger sized symbols coiresponding to more massive samples. Measurements taken from the literature are oveiplotted, with larger symbols 
for samples with M, > 10^^ A/q . The lines representing the baryonic mass fraction and the local stellar mass fraction are the same as described in Figure 4. 



apparently grows more rapidly than the dark component between 
l.Q < z < 2.0 for galaxies with M, > lO^^M© . However, 
as shown in Table 3, the average stellar mass in each stellar mass 
bin does not change significantly with cosmic time, while the av- 
erage DMH mass decreases by a factor ~ 5 within these stellar 
mass selected samples. Furthermore, the star-formation rate does 
not change dramatically in the redshift range 1.0 < z < 2.0 (e.g. 
Perez-Gonzalez et al. 2005; Tresse et al. 2007; Reddy et al. 2008), 
and the high-mass end of the stellar mass function does not evolve 
significantly below z ^ 1 — 2 (e.g. Drory et al. 2005; Bundy et al. 
2006; Fontana et al. 2006; Pozzetti et al. 2007; Marchesini et al. 
2009). This suggests that, instead of an increase of the average 
stellar mass, a statistical process is responsible for an apparent de- 
crease of the average mass of massive DMHs between z ~ 2.0 and 
z ^ 1.0 for galaxies with stellar masses 

We propose that these observations for very massive galaxies 
{M, > 1O" °M0) can be explained by a "halo downsizing" effect 
(e.g. Neistein et al. 2006). We assume that the most massive galax- 
ies are already in place at z ~ 2 in the most massive DMHs, in 
terms of their stellar material. Gradually the stellar mass bin we se- 
lect is populated by galaxies that have gained stellar mass with time 
but that are hosted by less massive DMHs than the older systems 
we find at higher redshifts. Meanwhile the most massive galaxies 



do not gain in stellar-mass, nor in DMH mass, due to a quenching 
of their star formation and a weak merging rate (Drory & Alvarez 
2008). As a consequence the average mass of the DMHs for these 
galaxies decreases, and the stellar mass ratio increases, which is 
what we find. 

Such an idea is supported by the "Archaeological downsizing" 
(Thomas et al. 2005), and the observation that the most massive 
galaxies appear to have assembled their stellar mass earlier than 
younger galaxies. Galaxies with M, > IO^^'^Mq assembled half 
of their stellar mass before z 1.5, and more than 90% of their 
mass was already in place at z ~ 0.6 (Bundy et al. 2006; Perez- 
Gonzalez et al. 2008). Such observations imply that the mechanism 
responsible for quenching star-formation in galaxies is strongly 
mass-dependent and that it occurs earlier in the most massive 
galaxies (Bundy et al. 2006, 2008). Using semi-analytical models, 
Cattaneo et al. (2008) have shown that the downsizing effect ob- 
served for red galaxies, could naturally result from a shutdown in 
star-formation. More massive central galaxies in today's Universe 
formed earlier and over a shorter period of time than less mas- 
sive galaxies. A critical mass of DMHs of A/dm ~ 10^^/i"^A/o , 
above which a shutdown in star-formation occurs, is also observed 
in these studies, corresponding roughly to the mass where we ob- 
serve a decline in the stellar-to-dark-matter-mass-ratio in Figure 4. 
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Figure 6. Evolution of the stellar mass fraction for massive galaxies as a function of dark matter mass (A/dm)- The values for our measurements, and previous 
values from the literature shown in Figure 4, are represented by the dots (small for z < 0.7 and enlarged for z > 0.7). The full squares connected by a full 
line represent the stellar fraction for the central galaxies in SDSS groups, and the long-dash-dotted Unes represent the stellar fraction for all galaxies in SDSS 



groups at different completeness limits of Af, 



10^ 



Mq and M« = 10 Mq , corresponding to redshift limits of z < 0.06 and z < 0.045 respectively. 



The two thick straight dotted lines represent slopes of the evolution of the total stellar fraction (in central and satellite galaxies as well as in the ICL) with 
DMH masses as detailed in Balogh et al. (2008), and renormalised arbitrarily for more clarity, as we are only interested in comparing slopes. The green, red 
and blue lines represent the predictions from the De Lucia & Blaizot (2007) model for the central galaxies at different redshifts (resp. z = 0, < z < 0.7 
and z > 0.7). The shaded red region represents the rms scatter of the predicted stellar mass fraction for the central galaxies at < z < 0.7. The red 
short-dash-long-dash line shows the prediction from the model for the total stellar mass fraction estimated from the De Lucia & Blaizot model. The horizontal 
continuous line represents the baryonic fraction measured from the WMAP5 analysis (Komatsu et al. 2009), and the horizontal dashed line represents the 
mean stellar fraction in the local Universe as measured by Cole et al. (2001). 



As mentioned in Section 4.2, the "Assembly bias" effect could 
also mimic the observed increase of stellar-mass ratio with lower 
redshifts. Croton et al. (2007a) have shown that, at fixed halo 
mass, more massive and clustered central galaxies tend to occupy 
haloes that formed earlier, while less clustered central galaxies oc- 
cupy haloes that formed later. However, Gao & White (2007) have 
demonstrated that this effect is unobservable for massive DMHs 
(A^DM ~ 10^'^ Mq ). Although we cannot rule out that assem- 
bly bias could artificially increase the ratio of stellar to dark matter 
mass which we observe, this effect is likely not large enough to 
account for the observed evolution. 

The "halo downsizing" effect could also be used to explain the 
observation that galaxies in more massive haloes contain a lower 
stellar-mass fraction than those in less massive haloes, as seen in 
Figure 4. A tentative explanation of the formation of massive galax- 



ies could be that at higher redshift the most massive DMHs formed 
rapidly by accretion or mergers (Conselice et al. 2003). This would 
provide galaxies less time to form stars and become more subject 
to a shutdown of their star-formation, compared with less massive 
DMHs (Bundy et al. 2006; Perez-Gonzalez et al. 2008). Moreover, 
the gas accreted in the more massive DMHs is hot, which cannot 
cool quickly in such massive potentials, suppressing further the 
star-formation in these massive galaxies, as observed for groups 
and clusters (Gonzalez et al. 2007). 

The main difficulties in testing directly the scenario described 
above are in measuring accurately the stellar and dark matter 
masses involved. We assume within our clustering-derived masses, 
and through the other mass estimations, that masses of DMHs, dy- 
namical masses, and total masses, are equivalent. However, it is 
not always easy to compare dynamical masses with the often used 
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Figure 7. The distribution in stellar mass for satellites in DMHs of masses 
lO" O/i-lAf0 < A/dm < lO^^.s^-i^^^ r^he: histogram in black rep- 
resents the distribution as observed in SDSS groups at z < 0.06, and the 
histogram in red is the distribution in the Millennium simulation. The ver- 
tical dashed line represents the completeness Umit in stellar mass of the 
SDSS at 2 < 0.06. 



M500 and M200 masses, derived for a radius of a system corre- 
sponding to overdensities of 500 and 200, relative to the critical 
density at a given redshift. Furthermore, dynamical masses are ex- 
tremely difficult to measure in low mass systems, particularly at 
high redshift. On the other hand, the total amount of stellar mass 
and gas mass in the most massive systems is also difficult to mea- 
sure, and requires extensive observations. For instance, a deeper 
study in the near-infrared bands of local groups and clusters is re- 
quired to obtain a better measure of the stellar mass fractions in the 
satellites galaxies and in the ICL. In addition, a better estimate of 
the gas content of clusters and groups, through very detailed X-ray 
studies of a large sample, would be extremely valuable. 



5.2 Modelling the mass assembly of massive DMHs with 
semi-analytic models 

5.2.1 Modelling the history of the central galaxy 

Semi-analytic models are known to struggle in reproducing the ob- 
served formation of the most massive galaxies, pushing the for- 
mation of these systems to very late epochs (e.g. Kauffmann & 
Chariot 1998; Conselice et al. 2007). By introducing AGN feed- 
back (e.g. Springel et al. 2005a; Bower et al. 2006; Croton et al. 
2006) the most recent models now manage to reproduce a star for- 
mation "downsizing" effect, implying that the stars in more mas- 
sive galaxies form early in the Universe (De Lucia et al. 2006). In 
order to check if these models are able to reproduce the behaviour 
we observe between stellar mass fraction and total mass, how this 
fraction evolves with redshift, and to better understand the under- 
lying physical processes involved, we compare our results to those 
from one of the current leading models. We use the Millennium 



simulations (Springel et al. 2005b) in which were implemented the 
semi-analytic models developed by De Lucia & Blaizot (2007). 

The Millennium model predictions for the stellar mass frac- 
tion of the central galaxy of DMHs, at different redshifts {z — 0, 
< 2 < 0.7, z > 0.7), are shown in Figure 6, with the dashed 
area showing the rms for predicted DMHs at < z < 0.7. 
As expected, the peak in the mass ratio vs. halo mass occurs at 
A/dm ~ 10^^h~^ Mq , where star formation is known to be the 
most efficient. In less massive DMHs the infall time of gas clouds 
is too long to form stars efficiently, while in more massive haloes it 
is the cooling time of the gas cloud that is too long (White & Frenk 
1991). By construction, the mean value of this ratio in the models is 
similar to the observed stellar-mass fraction in the local Universe. 
The model from De Lucia & Blaizot reproduces well the behaviour 
observed for the central galaxies in the most massive DMHs at low 
redshifts. However, at higher redshifts the agreement is not as good, 
with the model showing lower stellar mass fractions compared to 
our observations. These models seem to not form massive galaxies 
quickly enough to reproduce the observations (see also Conselice 
et al. 2007, for this comparison done on stellar mass densities). 

Previously, Kitzbichler & White (2007) used De Lucia & 
Blaizot's models to study the high-redshift galaxy population. They 
compared the observed mass functions from Drory et al. (2005) and 
Fontana et al. (2006) with simulations, and found an overall over- 
prediction by the simulations between 1 < z < 3 in the mass 
range 10^°A/q < Af* < lO" Af© . On the other hand their predic- 
tion shows a deficiency in this redshift range for the most massive 
galaxies with Af * > 10^^ A/q . Bower et al. (2006) use a different 
semi-analytic model with a different recipe for reproducing AGN 
feedback, also based on the Millennium simulation, but find re- 
sults very similar to the De Lucia & Blaizot model (Kitzbichler & 
White 2007). By comparing the mass functions derived from their 
models with the measurements by Drory et al. (2005), Bower et al. 
also find an underprediction of the number densities for the most 
massive galaxies at high redshifts. Using the sample of A'-band 
selected galaxies at z ~ 0.4 — 2 presented here, Conselice et al. 
(2007) confirm this result by studying the mass growth of the most 
massive galaxies. These underpredictions in the number of massive 
galaxies at higher redshift by the semi-analytic models is likely re- 
sponsible for the disagreement seen in Figure 6. On the other hand, 
observed mass errors in massive systems can be partially respon- 
sible for the overestimation at the steep end of the mass function 
(e.g. Kitzbichler & White 2007). 

Furthermore, Conroy & Wechsler (2009) introduced a simple 
model based on N-body simulations and abundance matching be- 
tween galaxies and DMHs that can shed some light on this prob- 
lem. This model is tuned a posteriori to reproduce the observed 
evolution of the stellar-mass function and star-formation rate his- 
tory within galaxies. With this simple model, Conroy & Wechsler 
(2009) reproduce the shape of the stellar mass fraction as a func- 
tion of DMH mass as shown in Figure 6. Their model does not 
address explicitly, by construction, the growth history of galaxies 
more massive than Af» > lO^^Af0 . However, their model infers 
a shift in the masses of DMH for which the stellar mass fraction 
peaks from higher mass at high redshift, to lower mass at lower 
redshift {Mdm ~ W^^h'^MQ at « ~ 2 to ~ 10" ''/i"^ A/q at 
z ~ 0), where the Millennium simulation predicts the opposite. 
This model favours a range of characteristic masses of DMHs for 
which a shutdown in star-formation occurs, and is not defined for 
DMH masses of A/dm ~ W^^h'^AlQ (e.g. Cattaneo et al. 2008). 
Our current dataset can not address this issue, but larger and deeper 
surveys should be able to observe such a phenomenon. 
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5.2.2 Stellar fraction in massive DMHs 



As mentioned in Section 5.1, a non-negligible amount of stellar 
mass within massive dark matter haloes could be contained in satel- 
lite galaxies. We investigate this idea further to determine the actual 
stellar to halo mass ratio for our most massive systems. We also 
use the available semi-analytic catalogue derived from De Lucia & 
Blaizot (2007), in which central and satellite galaxies are differen- 
tiated, to estimate the total stellar mass in each simulated DMH. We 
do this by adding the stellar mass included in satellites to the mass 
of the central galaxies. We find that in high mass DMHs the stellar 
mass fraction almost matches the value of the mean stellar fraction 
in the Universe, as shown by the red long-dash-short-dashed line in 
Figure 6. This prediction of a nearly constant stellar mass fraction 
is not surprising, as it is an effect of the construction of the semi- 
analytical models. The most massive systems (groups and clusters) 
in the models are built from DMHs with a similar mass distribution, 
and over a similar time-scale. They will therefore form a similar 
amount of stars, resulting in a similar stellar-mass fraction. 

In Figure 6 we show the predicted evolution of the stellar mass 
fraction with mass for DMHs derived by Balogh et al. (2008) us- 
ing the model of Bower et al. (2006), with a nearly horizontal dot- 
ted line with a slope of ciLog(M,/MDM)/dLogAfDM = —0.05. 
The model from Bower et al. (2006) leads to similar conclusions 
as De Lucia & Blaizot (2007). Furthermore, the trend found within 
the SDSS groups, with a lower mass completeness limit of hh ~ 
10^' ^Mq , seems to converge to the slope determined by the mod- 
els. On the other hand, Balogh et al. (2008) derived an estimate of 
total stellar masses and dynamical masses from the sample of clus- 
ters from Lin & Mohr (2004) and Gonzalez et al. (2007), both tak- 
ing into account intracluster light. The slope derived from their con- 
servative model, dLog{M:t /Mum) / dLogMuM ~ —0.35, does 
not match the stellar mass fraction from the SDSS groups. This 
could indicate an underestimate of the total stellar-mass in their 
study. 

In order to verify if the stellar mass fraction derived from the 
SDSS groups, and the predictions from the De Lucia & Blaizot's 
model, are in good agreement, we show in Figure 7 the distribution 
in stellar mass of satellite galaxies selected in DMHs of masses 
lO" °/i"^M0< Mom < lO"'^ft"^M0 from the SDSS. This 
figure shows that the satellite galaxies in the SDSS groups reach 
a plateau before the completeness limit of i\f, = lO^'^i\f0 , while 
the number of satellites in the simulations keeps rising. Such a flat- 
tening at the faint-end is observed in the luminosity function of 
the Local Group (e.g. Pritchet & van den Bergh 1999), confirming 
that simulations fail to reproduce the luminosity/mass function of 
satellites galaxies in groups. This is part of the well known missing 
satellite problem in CDM (e.g. Moore et al. 1999). At the complete- 
ness limits of the SDSS data there are twice as many satellites per 
DMH in the simulations than in the observations. Within the most 
massive part of this distribution it appears that the simulations lack 
massive satellite galaxies when compared to the observed SDSS 
groups. This is more likely due to the limited-volume of the sim- 
ulations, but could also be due to the lack of massive galaxies in 
the overall mass functions as discussed above. However, the obser- 
vations contain many uncertainties as well. Estimating the stellar- 
masses of faint objects is still tricky, and the group membership of 
satellite galaxies, especially at the faint end, is difficult to ascertain. 
To solve this issue, a deeper study of groups and clusters is required 
to estimate, in a better manner, their total stellar and dark-matter 
masses. 

Finally, Figure 8 shows the variation of the stellar-mass frac- 
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Figure 8. Evolution of the stellai' mass fraction for massive galaxies with 
redshift. The values from our measurements and the literature, as shown 
in Figure 5, are represented by the dots (small for A/* < 10^^ Mq and 
enlarged for M« > 10^^ Mq ). The triangle symbols connected by a con- 
tinuous line represent the measurements made in this study for galaxies with 
stellar masses 10^^ M© < M* < 10^^ Mq . The red solid line represents 
the predictions from the De Lucia & Blaizot (2007) model for the central 
galaxies at < z < 0.7 with stellar masses of M, < 10"AfQ . The 
shaded red region is the rms scatter of the observed stellar mass fractions 
for the central galaxies at < z < 0.7. The red dash-dot line shows the 
prediction from the model for the total stellar mass fraction. The horizontal 
lines represent the baryonic mass fraction, and the local stellar mass frac- 
tion, as described in Figure 6. 



tion with redshift predicted from the simulation compared to the 
data themselves. This figure shows that no variation of the stellar 
mass fraction with redshift is predicted by the simulation for galax- 
ies with stellar masses A/, < IO^^A/q . The N-body simulations 
present a very tight correlation between the mass of the DMHs and 
the number of galaxies hosted by each DMH at any epoch, and are 
in this sense very close to Smooth Particle Hydrodynamic simula- 
tions and analytic Halo Occupation models (e.g. Berlind et al. 2003; 
Zheng et al. 2005; Weinberg et al. 2008). Therefore, the stellar mass 
fraction in a given range of stellar mass is constant with epoch, and 
by construction equal to the mean value in the local Universe, ex- 
plaining their failure to reproduce the observed increase in stellar 
mass fraction with lower redshift. Moreover, as shown in Figure 8, 
the effect of the fraction of mass in satellites is negligible here as 
well. However, because of the limited volume of the simulations, 
the number of galaxies with Al, > IO^^A/q does not allow for a 
direct comparisons with our observations. Simulations over larger 
volume are required, and observations of the stellar mass ratio of 
less massive galaxies at 1.0 < z < 2.0 would be very useful as 
well. 



6 SUMMARY 

We present in this paper an analysis of the clustering properties of 
a stellar mass selected sample of galaxies over 0.7 deg^ taken from 
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the Palomar Observatory Wide-Field Infrared Survey (POWIR). By 
utilising information from three separate fields, we measure the 
two-point correlation function, and based on this we derive the spa- 
tial correlation length and bias for our sample of galaxies within 
various stellar mass bins at M, > 1O^°M0 . Our major results are: 

I. We find that the correlation length (ro) varies with both red- 
shift and stellar mass. We find that the largest correlation lengths 
(most clustered systems) are found for galaxies at the highest stel- 
lar masses with 10"Mq < M* < lO^^M© at « ~ 2. The corre- 
lation length for galaxies at all stellar mass selections are found to 
decrease at lower redshift. 

II. We derive the dark matter halo masses for these stellar mass 
selected systems utilising the correlation lengths and models from 
Mo & White (2002). Due to a likely contamination between our 
samples, the correlation lengths and therefore the dark matter halo 
masses are lower limit estimates. However we take into account this 
effect in our error budgets. We find that halo masses for our galaxies 
selected by > 10^°-*M© are a factor of 30 to 170 times the 
stellar masses of each galaxy. This is in very good agreement with 
other methods for measuring halo masses, including lensing and 
kinematics. 

III. We find a remarkable relation between dark matter halo mass 
(Mdm), and the ratio of stellar-to-dark-matter-mass (M«/Mdm) 
for the central galaxy in the halo, and find that this relation does 
not vary much with redshift. This correlation is such that the more 
massive a dark matter halo mass Mdm is, the lower the ratio of 
stellar-to-halo-mass (M*/Mdm). We further find that this correla- 
tion exists over at least two orders of magnitude from halo masses 
Mdm = 10^^" to 10""/i~^Mq . This is true for all methods of 
determining halo masses, from lensing, kinematics, and clustering. 

IV. Within our massive galaxy sample at z ~ 1 — 2 we find that the 
ratio of stellar-to-halo-mass increases from high to low redshifts. 
This correlation implies that at a given stellar mass selection, the 
average underlying mass of the host halo decreases at lower red- 
shifts. This may imply that the most stellar massive systems at the 
highest redshifts are hosted by very massive haloes, but at lower 
redshift they stop forming new stars, and therefore do not increase 
their stellar mass, while other systems in the lower mass dark mat- 
ter haloes grow to reach similar stellar mass. This is an example of 
"Halo downsizing". The 'Halo Assembly bias' implying that clus- 
tering depends not only on the mass of the dark matter halo, but 
also on the assembly history and environment of the galaxy, can 
also contribute partially to this observed increase in the stellar mass 
fraction with redshift. 

V. We compare our results to the semi-analytic models built using 
the Millennium simulation. We find that there is roughly a good 

agreement between models and the data for the central galaxy. 
The predicted value of the total-stellar-to-total-halo mass ratio de- 
creases only slightly with increasing halo mass, also in rough agree- 
ment with our observational results. We use the SDSS group cata- 
logue from Weinmann et al. (2006) to examine whether the miss- 
ing stellar mass can be accounted for by observed satellite galaxies. 
We find that, even though the overall agreement is good, the model 
marginally overpredicts the number of faint galaxies in comparison 
to the SDSS catalogue. Moreover the Millennium simulation fails 
to reproduce the "halo downsizing" effect we observe. We argue 
that this missing mass, in the more massive haloes, is most proba- 
bly in the form of baryons in a warm/hot phase. 
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